Analysing Effect of Database Grouping on Multi-Database Mining

نویسندگان

  • Animesh Adhikari
  • Lakhmi C. Jain
  • Sheela Ramanna
چکیده

In many applications we need to synthesize global patterns in multiple large databases, where the applications are independent of the characteristics of local patterns. Pipelined feedback technique (PFT) seems to be the most effective technique under the approach of local pattern analysis (LPA). The goal of this paper is to analyse the effect of database grouping on multi-database mining. For this purpose we design a database grouping algorithm. We introduce an approach of non-local pattern analysis (NLPA) by combining database grouping algorithm and pipelined feedback technique for multi-database mining. We propose to judge the effectiveness of non-local pattern analysis for multi-database mining. We conduct experiments on both real and synthetic databases. Experimental results show that the approach to non-local pattern analysis does not always improve the accuracy of mining global patterns in multiple databases. Index Terms — Local pattern analysis, Multi-database mining, Non-local pattern analysis, Pipelined feedback technique, Synthesis of patterns

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Survey on Document Clustering For Identifying Criminal

Crimes are a social nuisance and cost our society dearly in several ways. Crime investigation has very significant role of police system in any country. Developing a good crime analysis tool to identify crime patterns quickly and efficiently for future crime pattern detection is required. This paper presents combine approach of clustering, outlier detection and providing the rule engine to iden...

متن کامل

Geometric clustering models for multimedia databases

Recently, in the elds of information retrieval, Data Mining, or Knowledge Discovery in Databases (KDD), is intensively studied to extract implicit useful information from large amount of data. One of the important objectives of KDD is to obtain generalizations by grouping similar objects via clustering. In the case of multi-media databases such as full text database and image database, geometri...

متن کامل

Data sanitization in association rule mining based on impact factor

Data sanitization is a process that is used to promote the sharing of transactional databases among organizations and businesses, it alleviates concerns for individuals and organizations regarding the disclosure of sensitive patterns. It transforms the source database into a released database so that counterparts cannot discover the sensitive patterns and so data confidentiality is preserved ag...

متن کامل

Using Data Mining and Three Decision Tree Algorithms to Optimize the Repair and Maintenance Process

The purpose of this research is to predict the failure of devices using a data mining tool. For this purpose, at the outset, an appropriate database consists of 392 records of ongoing failures in a pharmaceutical company in 1394, in the next step, by analyzing 9 characteristics and type of failure as a database class, analyzes have been used. In this regard, three decision tree algorithms have ...

متن کامل

Social Network Trend Analysis Using Frequent Pattern Mining and Self Organizing Maps

A technique for identifying, grouping and analysing trends in social networks is described. The trends of interest are defined in terms of sequences of support values for specific patterns that appear across a given social network. The trends are grouped using a SOM technique so that similar trends are clustered together. A cluster analysis technique is then applied to identify “interesting” tr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IEEE Intelligent Informatics Bulletin

دوره 12  شماره 

صفحات  -

تاریخ انتشار 2011